Overview
Other statistics
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 2874 |
| Missing cells | 583 |
| Missing cells (%) | 1.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 15.9 MiB |
| Average record size in memory | 5.7 KiB |
Variable types
| Categorical | 6 |
|---|---|
| Numeric | 12 |
title has a high cardinality: 1255 distinct values | High cardinality |
id_thread has a high cardinality: 2865 distinct values | High cardinality |
tokens has a high cardinality: 2454 distinct values | High cardinality |
interlocutors has a high cardinality: 1766 distinct values | High cardinality |
dates has a high cardinality: 2170 distinct values | High cardinality |
n_posts is highly correlated with n_interlocutors and 7 other fields | High correlation |
n_interlocutors is highly correlated with n_posts and 7 other fields | High correlation |
mean_post_per_interlocutor is highly correlated with n_posts and 6 other fields | High correlation |
mean_post_per_interlocutor_with_anonymous is highly correlated with n_posts and 6 other fields | High correlation |
max_post_per_interlocutor_with_anonymous is highly correlated with n_posts and 6 other fields | High correlation |
n_tokens is highly correlated with n_posts and 8 other fields | High correlation |
mean_tokens is highly correlated with n_tokens and 4 other fields | High correlation |
min_tokens is highly correlated with n_posts and 3 other fields | High correlation |
max_tokens is highly correlated with n_posts and 8 other fields | High correlation |
n_tokens_stopwords is highly correlated with n_posts and 8 other fields | High correlation |
mean_tokens_stopwords is highly correlated with n_tokens and 4 other fields | High correlation |
n_posts is highly correlated with n_interlocutors and 5 other fields | High correlation |
n_interlocutors is highly correlated with n_posts and 5 other fields | High correlation |
n_anonymes is highly correlated with n_posts and 4 other fields | High correlation |
mean_post_per_interlocutor is highly correlated with n_tokens_stopwords | High correlation |
mean_post_per_interlocutor_with_anonymous is highly correlated with n_posts and 4 other fields | High correlation |
max_post_per_interlocutor_with_anonymous is highly correlated with n_posts and 4 other fields | High correlation |
n_tokens is highly correlated with n_posts and 6 other fields | High correlation |
mean_tokens is highly correlated with min_tokens and 2 other fields | High correlation |
min_tokens is highly correlated with mean_tokens and 1 other fields | High correlation |
max_tokens is highly correlated with n_tokens and 3 other fields | High correlation |
n_tokens_stopwords is highly correlated with n_posts and 4 other fields | High correlation |
mean_tokens_stopwords is highly correlated with mean_tokens and 2 other fields | High correlation |
n_posts is highly correlated with n_interlocutors and 5 other fields | High correlation |
n_interlocutors is highly correlated with n_posts and 5 other fields | High correlation |
mean_post_per_interlocutor is highly correlated with n_posts and 5 other fields | High correlation |
mean_post_per_interlocutor_with_anonymous is highly correlated with n_posts and 5 other fields | High correlation |
max_post_per_interlocutor_with_anonymous is highly correlated with n_posts and 5 other fields | High correlation |
n_tokens is highly correlated with n_posts and 8 other fields | High correlation |
mean_tokens is highly correlated with n_tokens and 3 other fields | High correlation |
max_tokens is highly correlated with n_tokens and 3 other fields | High correlation |
n_tokens_stopwords is highly correlated with n_posts and 8 other fields | High correlation |
mean_tokens_stopwords is highly correlated with n_tokens and 3 other fields | High correlation |
n_posts is highly correlated with n_interlocutors and 5 other fields | High correlation |
n_interlocutors is highly correlated with n_posts and 5 other fields | High correlation |
n_anonymes is highly correlated with n_posts and 5 other fields | High correlation |
mean_post_per_interlocutor is highly correlated with n_tokens and 1 other fields | High correlation |
mean_post_per_interlocutor_with_anonymous is highly correlated with n_posts and 5 other fields | High correlation |
max_post_per_interlocutor_with_anonymous is highly correlated with n_posts and 5 other fields | High correlation |
n_tokens is highly correlated with n_posts and 7 other fields | High correlation |
mean_tokens is highly correlated with min_tokens and 2 other fields | High correlation |
min_tokens is highly correlated with mean_tokens and 2 other fields | High correlation |
max_tokens is highly correlated with n_tokens and 4 other fields | High correlation |
n_tokens_stopwords is highly correlated with n_posts and 7 other fields | High correlation |
mean_tokens_stopwords is highly correlated with mean_tokens and 2 other fields | High correlation |
title has 583 (20.3%) missing values | Missing |
n_posts is highly skewed (γ1 = 21.42687354) | Skewed |
n_interlocutors is highly skewed (γ1 = 21.42687354) | Skewed |
n_anonymes is highly skewed (γ1 = 40.68570103) | Skewed |
mean_post_per_interlocutor_with_anonymous is highly skewed (γ1 = 48.14186938) | Skewed |
max_post_per_interlocutor_with_anonymous is highly skewed (γ1 = 39.66133588) | Skewed |
id_thread is uniformly distributed | Uniform |
n_anonymes has 1166 (40.6%) zeros | Zeros |
mean_post_per_interlocutor has 1007 (35.0%) zeros | Zeros |
mean_post_per_interlocutor_with_anonymous has 86 (3.0%) zeros | Zeros |
n_tokens has 62 (2.2%) zeros | Zeros |
mean_tokens has 62 (2.2%) zeros | Zeros |
min_tokens has 358 (12.5%) zeros | Zeros |
max_tokens has 62 (2.2%) zeros | Zeros |
n_tokens_stopwords has 59 (2.1%) zeros | Zeros |
mean_tokens_stopwords has 59 (2.1%) zeros | Zeros |
Reproduction
| Analysis started | 2021-11-18 15:53:33.289417 |
|---|---|
| Analysis finished | 2021-11-18 15:55:14.399389 |
| Duration | 1 minute and 41.11 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
| Distinct | 1255 |
|---|---|
| Distinct (%) | 54.8% |
| Missing | 583 |
| Missing (%) | 20.3% |
| Memory size | 209.2 KiB |
| Discussions | |
|---|---|
| Avis | |
| Supprimer | 153 |
| Avis non décomptés | 131 |
| Conserver | 90 |
| Other values (1250) |
Length
| Max length | 147 |
|---|---|
| Median length | 17 |
| Mean length | 19.95024007 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1189 ? |
|---|---|
| Unique (%) | 51.9% |
Sample
| 1st row | Bandeaux à foison |
|---|---|
| 2nd row | Ton de l'article |
| 3rd row | Proposition de fusion entre [[Industrie de la houille blanche]] et [[Houille blanche]] |
| 4th row | mon article industries de la houille blanche en Maurienne |
| 5th row | articles enrichis |
Common Values
| Value | Count | Frequency (%) |
| Discussions | 186 | 6.5% |
| Avis | 163 | 5.7% |
| Supprimer | 153 | 5.3% |
| Avis non décomptés | 131 | 4.6% |
| Conserver | 90 | 3.1% |
| Fichier proposé à la suppression sur Commons | 65 | 2.3% |
| Liens externes modifiés | 51 | 1.8% |
| Votes | 25 | 0.9% |
| Neutre | 18 | 0.6% |
| Avis divers non décomptés | 16 | 0.6% |
| Other values (1245) | 1393 | |
| (Missing) | 583 |
Length
| Value | Count | Frequency (%) |
| avis | 315 | 4.8% |
| de | 300 | 4.6% |
| discussions | 193 | 3.0% |
| 186 | 2.9% | |
| la | 166 | 2.6% |
| non | 159 | 2.4% |
| supprimer | 156 | 2.4% |
| décomptés | 147 | 2.3% |
| à | 127 | 2.0% |
| et | 104 | 1.6% |
| Other values (2271) | 4646 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2865 |
|---|---|
| Distinct (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 185.4 KiB |
| 2927500_2 | 2 |
|---|---|
| 58110_4 | 2 |
| 2141430_2 | 2 |
| 413290_23 | 2 |
| 3094690_2 | 2 |
| Other values (2860) |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 9.011134308 |
| Min length | 7 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 2856 ? |
|---|---|
| Unique (%) | 99.4% |
Sample
| 1st row | 11324890_1 |
|---|---|
| 2nd row | 11324890_2 |
| 3rd row | 11324890_3 |
| 4th row | 11324890_4 |
| 5th row | 11324890_5 |
Common Values
| Value | Count | Frequency (%) |
| 2927500_2 | 2 | 0.1% |
| 58110_4 | 2 | 0.1% |
| 2141430_2 | 2 | 0.1% |
| 413290_23 | 2 | 0.1% |
| 3094690_2 | 2 | 0.1% |
| 1309500_2 | 2 | 0.1% |
| 1881180_2 | 2 | 0.1% |
| 6329110_4 | 2 | 0.1% |
| 1277240_2 | 2 | 0.1% |
| 10482000_2 | 1 | < 0.1% |
| Other values (2855) | 2855 |
Length
| Value | Count | Frequency (%) |
| 2927500_2 | 2 | 0.1% |
| 2141430_2 | 2 | 0.1% |
| 413290_23 | 2 | 0.1% |
| 3094690_2 | 2 | 0.1% |
| 1309500_2 | 2 | 0.1% |
| 1881180_2 | 2 | 0.1% |
| 6329110_4 | 2 | 0.1% |
| 1277240_2 | 2 | 0.1% |
| 58110_4 | 2 | 0.1% |
| 1166590_1 | 1 | < 0.1% |
| Other values (2855) | 2855 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 60 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.453723034 |
| Minimum | 1 |
|---|---|
| Maximum | 437 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 15 |
| Maximum | 437 |
| Range | 436 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 11.52131977 |
|---|---|
| Coefficient of variation (CV) | 2.58689633 |
| Kurtosis | 723.0871271 |
| Mean | 4.453723034 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.42687354 |
| Sum | 12800 |
| Variance | 132.7408093 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1481 | |
| 2 | 349 | 12.1% |
| 3 | 197 | 6.9% |
| 4 | 131 | 4.6% |
| 5 | 116 | 4.0% |
| 7 | 79 | 2.7% |
| 6 | 72 | 2.5% |
| 8 | 72 | 2.5% |
| 9 | 52 | 1.8% |
| 11 | 50 | 1.7% |
| Other values (50) | 275 | 9.6% |
| Value | Count | Frequency (%) |
| 1 | 1481 | |
| 2 | 349 | 12.1% |
| 3 | 197 | 6.9% |
| 4 | 131 | 4.6% |
| 5 | 116 | 4.0% |
| 6 | 72 | 2.5% |
| 7 | 79 | 2.7% |
| 8 | 72 | 2.5% |
| 9 | 52 | 1.8% |
| 10 | 44 | 1.5% |
| Value | Count | Frequency (%) |
| 437 | 1 | |
| 190 | 1 | |
| 129 | 1 | |
| 78 | 1 | |
| 74 | 2 | |
| 73 | 1 | |
| 71 | 1 | |
| 64 | 1 | |
| 63 | 1 | |
| 60 | 1 |
| Distinct | 2454 |
|---|---|
| Distinct (%) | 85.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.4 MiB |
| ['discussion', 'ci-dessous'] | 130 |
|---|---|
| [] | 62 |
| ['message', 'déposer', 'automatiquement', 'robot'] | 52 |
| ['exception', 'faire', 'créateur', 'article', 'avis', 'utilisateur', 'récemment', 'inscrire', 'contribution', 'non', 'identifiable', 'IP', 'opinion', 'non', 'signer', 'principe', 'prendre', 'compte', 'être', 'cas', 'pouvoir', 'toutefois', 'participer', 'discussion', 'exprimer', 'ci-dessous', 'information'] | 48 |
| ['exception', 'faire', 'créateur', 'article', 'avis', 'utilisateur', 'récemment', 'inscrire', 'contribution', 'non', 'identifiable', 'IP', 'principe', 'prendre', 'compte', 'être', 'cas', 'pouvoir', 'toutefois', 'participer', 'discussion', 'exprimer', 'ci-dessous', 'information'] | 39 |
| Other values (2449) |
Length
| Max length | 53764 |
|---|---|
| Median length | 300 |
| Mean length | 867.4822547 |
| Min length | 2 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 2418 ? |
|---|---|
| Unique (%) | 84.1% |
Sample
| 1st row | ['rarement', 'article', 'fournir', 'bandeau', 'relever', 'article', 'respecte', 'doute', 'guère', 'grammaire', 'wikipédienn', 'lucky', 'Luke', 'bandeau', 'encombrer', 'guère', 'bon', 'usage', 'absence', 'débat', 'pdd', 'regrettable', 'y', 'page', 'fort', 'utile', 'bref', 'article', 'éminemment', 'perfectible', 'forme', 'diversité', 'sourçage', 'sauver', 'avis', 'Bonjour', 'poseur', 'bandeau', 'article', 'admissibilité', 'article', 'préciser', 'chose', 'considérer', 'lucky', 'Luke', 'bandeau', 'répondre', 'bandeau', 'poser', 'Copyvio', 'contributeur', 'remercier', 'rédaction', 'fournir', 'lien', 'accès', 'ressource', 'lien', 'accès', 'ressource', 'thèse', '559', 'page', 'y', 'copier', 'coller', 'section', 'ti', 'Complétons', 'fin', 'ri', 'Wikipédia', 'travail', 'inédit', 'opinion', 'excessivement', 'minoritaire', 'associer', 'source', 'juger', 'confidentiel', 'fiable', 'voire', 'simplement', 'interprétation', 'déduction', 'intuition', 'personnel', 'rédacteur', 'article', 'exemple', 'section', 'évolution', 'jour', 'facteur', 'jouer', 'faveur', 'vallée', 'fin', 'xix', 'siècle', 'retourner', 'inconvénient', 'disparaître', 'rente', 'énergétique', 'commander', 'couplage', 'usine', 'centrale', 'hydroélectrique', 'section', 'glorieux', 'conglomérat', 'omniprésent', 'vallée', 'falloir', 'oublier', 'envergure', 'non', 'national', 'exercer', 'stratégie', 'champ', 'mondial', 'doute', 'solidité', 'ancrage', 'nord-alpin', 'section', '1914', '1939', 'mention', 'spécial', 'faire', 'usine', 'Epierre', 'bas', 'Maurienne', 'four', 'électrique', 'origine', 'jusqu’', 'fermeture', 'consacrer', 'fabrication', 'dérivé', 'phosphore', 'agir', 'relocalisation', 'firme', 'Coignet', 'origine', 'lyonnais', 'terme', 'pérégrination', 'bandeau', 'sourçage', 'article', 'mérite', 'certainement', 'mieux', 'source', 'bien', 'qualité', 'unique', 'source', 'fête', '40', 'an', 'poser', 'problématique', 'mise', 'jour', 'également', 'partisan', 'sauver', 'article', 'contenu', 'favorable', 'fusion', 'toilettage', 'profond', 'article', 'Cdt', 'vrai', 'accumulation', 'bandeau', 'interpelle', 'fond', 'article', 'souffrir', 'bel', 'bien', 'multiple', 'problème', 'source', 'wikifier', 'ti', 'mesure', 'appuyer', 'thèse', 'ancien', 'important', 'essayer', 'expliquer', 'patiemment', 'auteur', 'manifestement', 'spécialiste', 'usage', 'encyclopédie', 'monde', 'sorte', 'gagnant', 'bonjour', 'bref', 'commencer', 'renommer', 'article', 'houille', 'blanc', 'Maurienne', 'trouver', 'source', 'secondaire', 'solide', 'traiter', 'sujet', 'résoudre', 'problème', 'signaler', 'voir', 'y', 'lieu', 'non', 'conserver', 'article', 'moment', 'page', 'apparaître', 'non', 'ti', 'copyvio', 'simple', 'fiche', 'lecture', 'résumé', 'thèse', 'Louis', 'Chabert', 'exception', 'dernier', 'section', 'constitue', 'sujet', 'admissible', 'absence', 'source', 'secondaire', 'thèse', 'caractère', 'simple', 'résumé', 'non', 'admissible', 'source', 'secondaire', 'indépendant', 'évaluer', 'thèse', 'ailleurs', 'confirmer', 'https://fr.wikipedia.org/w/index.php?title=industrie_de_la_houille_blanchediff=143234225oldid=143234146', 'commentaire', 'création', 'article'] |
|---|---|
| 2nd row | ['déplacement', 'bandeau', 'ti', 'section', 'douteux', 'qualifierez', 'vous', 'trop', 'enjouer', 'promotionnel', 'passage', 'citer', 'haut', 'page', 'réponse', 'bonjour', 'bien', 'problème', 'source', 'unique', 'évite', 'difficilement', 'générer', 'manque', 'neutralité', 'ici', 'haut', 'affaire', 'page', 'promouvoir', 'thèse', 'Louis', 'Chabert', 'constitue', 'sorte', 'fiche', 'lecture', 'non', 'critique', 'faute', 'source', 'secondaire', 'analyser', 'évaluer', 'thèse', 'souligner', 'thèse', 'Louis', 'Chabert', 'largement', 'financer', 'Péchiney', 'Ugine', 'Kuhlmann', 'voir', 'lien', 'indiquer', 'haut', 'expliquer', 'tendance', 'article', 'favoriser', 'touche', 'activité', 'groupe', 'très', 'présenter', 'article', 'conclusion', 'manque', 'neutralité', 'doute', 'problème', 'essentiel', 'article', 'créer', 'Bonjour', 'oui', 'dsl', 'on', 'demander', 'suppr', 'article', 'y', 'amélioration', 'bout', 'temps', 'coopération', 'Louis', 'Chabert', 'lacune', 'voir', 'état', 'paf', 'aboutir', 'argument', 'd', 'rrr_utilisateur', 'chabert', 'louis_rrr', 'chabert', 'Louis', 'actif', 'intervention', 'effectivement', 'nécessaire', 'assurer', 'conservation', 'article', 'bonjour', 'falloir', 'temps', 'lancer', 'procédure', 'suppression', 'rien', 'venir', 'falloir', 'remplacer', 'bandeau', 'info', 'cf.', 'https://fr.wikipedia.org/wiki/wikip%c3%a9dia:requ%c3%aate_aux_administrateursti_manifeste', '_', '1', 'immédiatement', 'création', 'bandeau', 'admis', 'poser', 'chabert', 'Louis', 'venir', 'réagir', 'rrr_utilisateur', 'chabert', 'louis_rrr', 'demande', 'temps', 'devoir', 'donner', 'article', 'potentiel', 'plaisir', 'lire'] |
| 3rd row | ['discussion', 'transférer', 'Wikipédia', 'page', 'fusionner', 'Bonjour', 'propose', 'retirer', 'jour', 'bandeau', 'fusion', 'archiver', 'pdd', 'respectif', 'procédure', 'ad', 'hoc', 'oui', 'RRR_Utilisateur', 'ot38_rrr', 'OT38', 'fusion', 'finalement', 'procédure', 'approprié', 'discussion', 'mérite', 'conserver', 'falloir', 'statuer', 'nuée', 'bandeau', 'copivio', 'admissibilité', 'etc.', 'historique', 'auteur', 'article', 'également', 'auteur', 'thèse', 'probable', 'thèse', '559', 'page', 'résume', 'grâce', 'copier', 'coller', 'bref', 'admissibilité', 'discute', 'pdd', 'fusion', 'contenu', 'évidence', 'gêne', 'beaucoup', 'article', 'industrie', 'houille', 'blanc', 'état', 'actuel', 'devoir', 'titrer', 'houille', 'blanc', 'Maurienne', 'coup', 'voir', 'bien', 'fusionner', 'article', 'appuyer', 'ailleurs', 'unique', 'source', 'contraire', 'demande', 'Wikipédia', 'risque', 'déséquilibrer', 'totalement', 'article', 'houille', 'blanc', 'vocation', 'traiter', 'industrie', 'houille', 'blanc', 'totalité', 'monde', 'Maurienne', 'France', 'Union', 'européen', 'monde', 'entier', 'moment', 'guère', 'ébauche', 'article', 'industrie', 'houille', 'blanc', 'Maurienne', 'commencer', 'démontrer', 'admissibilité', 'éventuel', 'renommage', 'résolution', 'problème', 'vouloir', 'insister', 'lancer', 'procédure', 'suppression', 'mieux', 'clôturer', 'procédure', 'lancer', 'réagir'] |
| 4th row | ['ajouter', 'référence', 'bibliographique', 'choix', 'arbitraire', 'difficulté', 'préférer', 'reporter', 'place', 'référence', 'thèse', 'résulte', 'référencer', 'fois', 'devenue', 'inutile', 'papeterie', 'charge', 'supprimer', 'dernier', 'part', '23', 'souvenir', 'bien', 'proposer', 'face', 'texte', 'gauche', 'écran', 'traitement', 'cosmétique', 'propre', 'terme', 'devoir', 'je', 'soumettre', 'texte', 'traitement', 'affaire', 'suppose', 'avoir', 'revoir', 'ensemble', 'texte', 'esprit', 'fois', 'opération', 'terminer', 'rester', '-t', 'il', 'faire', 'mettre', 'article', 'conformité', 'code', 'wikipedia', 'souhaite', 'bien', 'sûr', 'voir', 'bout', 'aide', 'zzznote', 'type', 'unsigned_non', 'signé|chabert', 'Louis|27', 'janvier', '2018', '17:02', 'CET)|144917352|notif='] |
| 5th row | ['enrichir', 'article', 'commune', 'Maurienne', 'orelle', 'saint-etienne-de-cuine', 'créer', 'nouveau', 'source', 'article', 'houille', 'blanche--'] |
Common Values
| Value | Count | Frequency (%) |
| ['discussion', 'ci-dessous'] | 130 | 4.5% |
| [] | 62 | 2.2% |
| ['message', 'déposer', 'automatiquement', 'robot'] | 52 | 1.8% |
| ['exception', 'faire', 'créateur', 'article', 'avis', 'utilisateur', 'récemment', 'inscrire', 'contribution', 'non', 'identifiable', 'IP', 'opinion', 'non', 'signer', 'principe', 'prendre', 'compte', 'être', 'cas', 'pouvoir', 'toutefois', 'participer', 'discussion', 'exprimer', 'ci-dessous', 'information'] | 48 | 1.7% |
| ['exception', 'faire', 'créateur', 'article', 'avis', 'utilisateur', 'récemment', 'inscrire', 'contribution', 'non', 'identifiable', 'IP', 'principe', 'prendre', 'compte', 'être', 'cas', 'pouvoir', 'toutefois', 'participer', 'discussion', 'exprimer', 'ci-dessous', 'information'] | 39 | 1.4% |
| ['exception', 'faire', 'créateur', 'article', 'avis', 'utilisateur', 'inscrire', 'contribution', 'non', 'identifiable', 'IP', 'principe', 'prendre', 'compte', 'être', 'cas', 'pouvoir', 'toutefois', 'participer', 'discussion', 'exprimer', 'ci-dessous', 'information'] | 30 | 1.0% |
| ['exception', 'faire', 'créateur', 'article', 'avis', 'utilisateur', 'récemment', 'inscrire', 'contribution', 'non', 'identifiable', 'IPs', 'opinion', 'non', 'signer', 'principe', 'décompter', 'être', 'cas', 'pouvoir', 'toutefois', 'participer', 'discussion', 'exprimer', 'ci-dessous', 'information'] | 13 | 0.5% |
| ['message', 'déposer'] | 10 | 0.3% |
| ['_', '_', 'noinde', '_', '_', 'instruction'] | 8 | 0.3% |
| ['terme', 'tour'] | 5 | 0.2% |
| Other values (2444) | 2477 |
Length
| Value | Count | Frequency (%) |
| article | 5217 | 2.4% |
| 2651 | 1.2% | |
| source | 2073 | 0.9% |
| y | 1584 | 0.7% |
| non | 1577 | 0.7% |
| faire | 1575 | 0.7% |
| bien | 1509 | 0.7% |
| avis | 1333 | 0.6% |
| page | 1283 | 0.6% |
| discussion | 1235 | 0.6% |
| Other values (19738) | 198998 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 1766 |
|---|---|
| Distinct (%) | 61.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 355.0 KiB |
| ['anonyme'] | |
|---|---|
| ['bot'] | 86 |
| ['anonyme', 'anonyme'] | 32 |
| ['ℳ𝒄𝓛𝒖𝒔𝒉FR'] | 7 |
| ['Azurfrog'] | 7 |
| Other values (1761) |
Length
| Max length | 4807 |
|---|---|
| Median length | 18 |
| Mean length | 55.2466945 |
| Min length | 7 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1694 ? |
|---|---|
| Unique (%) | 58.9% |
Sample
| 1st row | ['Borvan53', 'anonyme', 'anonyme', 'OT38', 'Binabik', 'Azurfrog'] |
|---|---|
| 2nd row | ['OT38', 'OT38', 'Azurfrog', 'OT38', 'Azurfrog', 'Borvan53', 'Azurfrog', 'OT38', 'Borvan53'] |
| 3rd row | ['anonyme', 'OT38', 'Borvan53', 'Borvan53', 'Azurfrog', 'Azurfrog', 'OT38', 'Nouill'] |
| 4th row | ['bot'] |
| 5th row | ['CHABERT Louis'] |
Common Values
| Value | Count | Frequency (%) |
| ['anonyme'] | 878 | |
| ['bot'] | 86 | 3.0% |
| ['anonyme', 'anonyme'] | 32 | 1.1% |
| ['ℳ𝒄𝓛𝒖𝒔𝒉FR'] | 7 | 0.2% |
| ['Azurfrog'] | 7 | 0.2% |
| ['Gemini1980'] | 6 | 0.2% |
| ['Patrick Rogel'] | 6 | 0.2% |
| ['Valérie75'] | 5 | 0.2% |
| ['anonyme', 'anonyme', 'anonyme'] | 5 | 0.2% |
| ['Christophe95'] | 4 | 0.1% |
| Other values (1756) | 1838 |
Length
| Value | Count | Frequency (%) |
| anonyme | 3706 | 25.1% |
| liege | 126 | 0.9% |
| chris | 126 | 0.9% |
| a | 126 | 0.9% |
| schlum | 116 | 0.8% |
| elnon | 114 | 0.8% |
| bot | 104 | 0.7% |
| patrick | 94 | 0.6% |
| rogel | 94 | 0.6% |
| unsigned | 93 | 0.6% |
| Other values (1922) | 10041 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
n_interlocutors
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 60 |
|---|---|
| Distinct (%) | 2.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.453723034 |
| Minimum | 1 |
|---|---|
| Maximum | 437 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 4 |
| 95-th percentile | 15 |
| Maximum | 437 |
| Range | 436 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 11.52131977 |
|---|---|
| Coefficient of variation (CV) | 2.58689633 |
| Kurtosis | 723.0871271 |
| Mean | 4.453723034 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 21.42687354 |
| Sum | 12800 |
| Variance | 132.7408093 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1481 | |
| 2 | 349 | 12.1% |
| 3 | 197 | 6.9% |
| 4 | 131 | 4.6% |
| 5 | 116 | 4.0% |
| 7 | 79 | 2.7% |
| 6 | 72 | 2.5% |
| 8 | 72 | 2.5% |
| 9 | 52 | 1.8% |
| 11 | 50 | 1.7% |
| Other values (50) | 275 | 9.6% |
| Value | Count | Frequency (%) |
| 1 | 1481 | |
| 2 | 349 | 12.1% |
| 3 | 197 | 6.9% |
| 4 | 131 | 4.6% |
| 5 | 116 | 4.0% |
| 6 | 72 | 2.5% |
| 7 | 79 | 2.7% |
| 8 | 72 | 2.5% |
| 9 | 52 | 1.8% |
| 10 | 44 | 1.5% |
| Value | Count | Frequency (%) |
| 437 | 1 | |
| 190 | 1 | |
| 129 | 1 | |
| 78 | 1 | |
| 74 | 2 | |
| 73 | 1 | |
| 71 | 1 | |
| 64 | 1 | |
| 63 | 1 | |
| 60 | 1 |
| Distinct | 2170 |
|---|---|
| Distinct (%) | 75.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 365.5 KiB |
| [None] | |
|---|---|
| [None, None] | 16 |
| [None, None, None] | 6 |
| ['2017-05-18T02:36'] | 3 |
| [None, '2011-03-13T17:02', '2011-03-13T17:02', None, '2011-03-13T15:14', '2011-03-18T22:00', '2011-03-13T12:29', '2011-03-13T15:14', '2011-03-18T22:00'] | 2 |
| Other values (2165) |
Length
| Max length | 2622 |
|---|---|
| Median length | 20 |
| Mean length | 73.17466945 |
| Min length | 6 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 2157 ? |
|---|---|
| Unique (%) | 75.1% |
Sample
| 1st row | ['2017-12-06T15:56', None, None, '2017-12-06T16:48', '2017-12-07T20:57', '2017-12-12T09:18'] |
|---|---|
| 2nd row | ['2017-12-12T09:52', None, '2017-12-12T10:04', '2017-12-12T10:15', None, '2017-12-12T12:15', '2017-12-12T12:37', '2017-12-12T12:52', '2017-12-12T23:21'] |
| 3rd row | [None, '2017-12-18T14:45', '2017-12-19T10:30', '2017-12-06T15:44', '2017-12-12T09:30', '2017-12-12T08:57', '2017-12-12T09:03', '2017-12-23T21:26'] |
| 4th row | ['2018-01-27T17:02'] |
| 5th row | ['2018-02-02T22:11'] |
Common Values
| Value | Count | Frequency (%) |
| [None] | 674 | 23.5% |
| [None, None] | 16 | 0.6% |
| [None, None, None] | 6 | 0.2% |
| ['2017-05-18T02:36'] | 3 | 0.1% |
| [None, '2011-03-13T17:02', '2011-03-13T17:02', None, '2011-03-13T15:14', '2011-03-18T22:00', '2011-03-13T12:29', '2011-03-13T15:14', '2011-03-18T22:00'] | 2 | 0.1% |
| ['2018-09-08T00:31'] | 2 | 0.1% |
| ['2010-08-06T13:13'] | 2 | 0.1% |
| ['2008-04-13T10:34'] | 2 | 0.1% |
| ['2017-09-05T14:43'] | 2 | 0.1% |
| ['2008-06-26T03:11', '2008-06-26T04:23', '2008-06-26T08:14'] | 2 | 0.1% |
| Other values (2160) | 2163 |
Length
| Value | Count | Frequency (%) |
| none | 3264 | 25.5% |
| 2010-01-25t00:46 | 27 | 0.2% |
| 2012-10-21t18:10 | 27 | 0.2% |
| 2016-12-20t00:41 | 15 | 0.1% |
| 2007-11-23t00:14 | 9 | 0.1% |
| 2016-12-20t01:27 | 9 | 0.1% |
| 2014-11-18t22:08 | 9 | 0.1% |
| 2011-07-01t10:33 | 9 | 0.1% |
| 2012-10-24t02:26 | 7 | 0.1% |
| 2006-03-20t23:50 | 6 | < 0.1% |
| Other values (5450) | 9418 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 27 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.289491997 |
| Minimum | 0 |
|---|---|
| Maximum | 437 |
| Zeros | 1166 |
| Zeros (%) | 40.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 3 |
| Maximum | 437 |
| Range | 437 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 9.159778728 |
|---|---|
| Coefficient of variation (CV) | 7.103400989 |
| Kurtosis | 1845.181063 |
| Mean | 1.289491997 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 40.68570103 |
| Sum | 3706 |
| Variance | 83.90154635 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1196 | |
| 0 | 1166 | |
| 2 | 233 | 8.1% |
| 3 | 147 | 5.1% |
| 4 | 60 | 2.1% |
| 5 | 22 | 0.8% |
| 6 | 10 | 0.3% |
| 7 | 9 | 0.3% |
| 8 | 7 | 0.2% |
| 10 | 6 | 0.2% |
| Other values (17) | 18 | 0.6% |
| Value | Count | Frequency (%) |
| 0 | 1166 | |
| 1 | 1196 | |
| 2 | 233 | 8.1% |
| 3 | 147 | 5.1% |
| 4 | 60 | 2.1% |
| 5 | 22 | 0.8% |
| 6 | 10 | 0.3% |
| 7 | 9 | 0.3% |
| 8 | 7 | 0.2% |
| 9 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 437 | 1 | |
| 189 | 1 | |
| 52 | 1 | |
| 49 | 1 | |
| 46 | 1 | |
| 31 | 1 | |
| 27 | 1 | |
| 26 | 1 | |
| 25 | 1 | |
| 24 | 1 |
n_bots
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 162.9 KiB |
| 0 | |
|---|---|
| 1 | 94 |
| 2 | 5 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2775 | |
| 1 | 94 | 3.3% |
| 2 | 5 | 0.2% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 2775 | |
| 1 | 94 | 3.3% |
| 2 | 5 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
mean_post_per_interlocutor
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 116 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.8543294176 |
| Minimum | 0 |
|---|---|
| Maximum | 10.25 |
| Zeros | 1007 |
| Zeros (%) | 35.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 2.142857143 |
| Maximum | 10.25 |
| Range | 10.25 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 0.8824121992 |
|---|---|
| Coefficient of variation (CV) | 1.03287114 |
| Kurtosis | 14.72205936 |
| Mean | 0.8543294176 |
| Median Absolute Deviation (MAD) | 0.3333333333 |
| Skewness | 2.51616118 |
| Sum | 2455.342746 |
| Variance | 0.7786512892 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1278 | |
| 0 | 1007 | |
| 2 | 103 | 3.6% |
| 1.5 | 88 | 3.1% |
| 1.333333333 | 40 | 1.4% |
| 1.25 | 30 | 1.0% |
| 3 | 23 | 0.8% |
| 1.4 | 16 | 0.6% |
| 2.5 | 16 | 0.6% |
| 1.666666667 | 15 | 0.5% |
| Other values (106) | 258 | 9.0% |
| Value | Count | Frequency (%) |
| 0 | 1007 | |
| 1 | 1278 | |
| 1.029411765 | 1 | < 0.1% |
| 1.052631579 | 2 | 0.1% |
| 1.0625 | 1 | < 0.1% |
| 1.066666667 | 1 | < 0.1% |
| 1.071428571 | 2 | 0.1% |
| 1.076923077 | 1 | < 0.1% |
| 1.083333333 | 2 | 0.1% |
| 1.090909091 | 7 | 0.2% |
| Value | Count | Frequency (%) |
| 10.25 | 1 | |
| 9.5 | 1 | |
| 7.333333333 | 1 | |
| 7 | 2 | |
| 6.8 | 1 | |
| 6.5 | 1 | |
| 6.2 | 1 | |
| 6 | 1 | |
| 5.8 | 1 | |
| 5.666666667 | 1 |
mean_post_per_interlocutor_with_anonymous
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWEDZEROS| Distinct | 136 |
|---|---|
| Distinct (%) | 4.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.479514499 |
| Minimum | 0 |
|---|---|
| Maximum | 437 |
| Zeros | 86 |
| Zeros (%) | 3.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 1.25 |
| 95-th percentile | 2.463888889 |
| Maximum | 437 |
| Range | 437 |
| Interquartile range (IQR) | 0.25 |
Descriptive statistics
| Standard deviation | 8.459169808 |
|---|---|
| Coefficient of variation (CV) | 5.71753086 |
| Kurtosis | 2453.668134 |
| Mean | 1.479514499 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 48.14186938 |
| Sum | 4252.124671 |
| Variance | 71.55755384 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 1940 | |
| 2 | 127 | 4.4% |
| 1.5 | 119 | 4.1% |
| 0 | 86 | 3.0% |
| 1.25 | 55 | 1.9% |
| 1.333333333 | 54 | 1.9% |
| 1.666666667 | 38 | 1.3% |
| 3 | 27 | 0.9% |
| 1.6 | 27 | 0.9% |
| 1.4 | 26 | 0.9% |
| Other values (126) | 375 | 13.0% |
| Value | Count | Frequency (%) |
| 0 | 86 | 3.0% |
| 1 | 1940 | |
| 1.038461538 | 1 | < 0.1% |
| 1.0625 | 2 | 0.1% |
| 1.071428571 | 1 | < 0.1% |
| 1.083333333 | 4 | 0.1% |
| 1.090909091 | 4 | 0.1% |
| 1.1 | 12 | 0.4% |
| 1.111111111 | 3 | 0.1% |
| 1.125 | 6 | 0.2% |
| Value | Count | Frequency (%) |
| 437 | 1 | |
| 95 | 1 | |
| 52 | 1 | |
| 46 | 1 | |
| 24 | 1 | |
| 14.6 | 1 | |
| 14 | 1 | |
| 10.66666667 | 1 | |
| 10 | 1 | |
| 9.8 | 1 |
max_post_per_interlocutor_with_anonymous
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONSKEWED| Distinct | 30 |
|---|---|
| Distinct (%) | 1.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.049756437 |
| Minimum | 1 |
|---|---|
| Maximum | 437 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 5 |
| Maximum | 437 |
| Range | 436 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 9.222505442 |
|---|---|
| Coefficient of variation (CV) | 4.499317712 |
| Kurtosis | 1782.396093 |
| Mean | 2.049756437 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 39.66133588 |
| Sum | 5891 |
| Variance | 85.05460662 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 2023 | |
| 2 | 385 | 13.4% |
| 3 | 209 | 7.3% |
| 4 | 106 | 3.7% |
| 5 | 40 | 1.4% |
| 6 | 28 | 1.0% |
| 7 | 18 | 0.6% |
| 8 | 13 | 0.5% |
| 10 | 10 | 0.3% |
| 9 | 7 | 0.2% |
| Other values (20) | 35 | 1.2% |
| Value | Count | Frequency (%) |
| 1 | 2023 | |
| 2 | 385 | 13.4% |
| 3 | 209 | 7.3% |
| 4 | 106 | 3.7% |
| 5 | 40 | 1.4% |
| 6 | 28 | 1.0% |
| 7 | 18 | 0.6% |
| 8 | 13 | 0.5% |
| 9 | 7 | 0.2% |
| 10 | 10 | 0.3% |
| Value | Count | Frequency (%) |
| 437 | 1 | |
| 189 | 1 | |
| 52 | 1 | |
| 49 | 1 | |
| 46 | 1 | |
| 38 | 1 | |
| 31 | 1 | |
| 27 | 1 | |
| 26 | 1 | |
| 25 | 1 |
| Distinct | 367 |
|---|---|
| Distinct (%) | 12.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 76.19102296 |
| Minimum | 0 |
|---|---|
| Maximum | 5604 |
| Zeros | 62 |
| Zeros (%) | 2.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 7 |
| median | 26 |
| Q3 | 70 |
| 95-th percentile | 290.35 |
| Maximum | 5604 |
| Range | 5604 |
| Interquartile range (IQR) | 63 |
Descriptive statistics
| Standard deviation | 201.8889962 |
|---|---|
| Coefficient of variation (CV) | 2.649774059 |
| Kurtosis | 258.0328356 |
| Mean | 76.19102296 |
| Median Absolute Deviation (MAD) | 21 |
| Skewness | 12.5526167 |
| Sum | 218973 |
| Variance | 40759.16677 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 178 | 6.2% |
| 7 | 151 | 5.3% |
| 6 | 118 | 4.1% |
| 4 | 91 | 3.2% |
| 27 | 74 | 2.6% |
| 24 | 72 | 2.5% |
| 0 | 62 | 2.2% |
| 5 | 60 | 2.1% |
| 8 | 55 | 1.9% |
| 10 | 53 | 1.8% |
| Other values (357) | 1960 |
| Value | Count | Frequency (%) |
| 0 | 62 | 2.2% |
| 1 | 21 | 0.7% |
| 2 | 178 | |
| 3 | 47 | 1.6% |
| 4 | 91 | |
| 5 | 60 | 2.1% |
| 6 | 118 | |
| 7 | 151 | |
| 8 | 55 | 1.9% |
| 9 | 49 | 1.7% |
| Value | Count | Frequency (%) |
| 5604 | 1 | |
| 3376 | 1 | |
| 3354 | 1 | |
| 2108 | 1 | |
| 2006 | 1 | |
| 1851 | 1 | |
| 1667 | 1 | |
| 1513 | 1 | |
| 1337 | 1 | |
| 1252 | 1 |
mean_tokens
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 739 |
|---|---|
| Distinct (%) | 25.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 18.00225457 |
| Minimum | 0 |
|---|---|
| Maximum | 282 |
| Zeros | 62 |
| Zeros (%) | 2.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 6 |
| median | 12 |
| Q3 | 23.075 |
| 95-th percentile | 51 |
| Maximum | 282 |
| Range | 282 |
| Interquartile range (IQR) | 17.075 |
Descriptive statistics
| Standard deviation | 21.84035879 |
|---|---|
| Coefficient of variation (CV) | 1.213201308 |
| Kurtosis | 33.9727512 |
| Mean | 18.00225457 |
| Median Absolute Deviation (MAD) | 7.5 |
| Skewness | 4.55091528 |
| Sum | 51738.47963 |
| Variance | 477.0012721 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 184 | 6.4% |
| 7 | 164 | 5.7% |
| 6 | 121 | 4.2% |
| 4 | 103 | 3.6% |
| 5 | 71 | 2.5% |
| 0 | 62 | 2.2% |
| 27 | 60 | 2.1% |
| 24 | 59 | 2.1% |
| 8 | 56 | 1.9% |
| 9 | 56 | 1.9% |
| Other values (729) | 1938 |
| Value | Count | Frequency (%) |
| 0 | 62 | |
| 0.1666666667 | 1 | < 0.1% |
| 0.1818181818 | 1 | < 0.1% |
| 0.1842105263 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.6666666667 | 1 | < 0.1% |
| 0.7142857143 | 1 | < 0.1% |
| 1 | 20 | 0.7% |
| 1.2 | 1 | < 0.1% |
| 1.5 | 3 | 0.1% |
| Value | Count | Frequency (%) |
| 282 | 1 | |
| 277 | 1 | |
| 245 | 1 | |
| 224 | 1 | |
| 223 | 1 | |
| 193 | 1 | |
| 192 | 1 | |
| 181 | 1 | |
| 176 | 2 | |
| 163 | 1 |
| Distinct | 107 |
|---|---|
| Distinct (%) | 3.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11.77139875 |
| Minimum | 0 |
|---|---|
| Maximum | 282 |
| Zeros | 358 |
| Zeros (%) | 12.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 5 |
| Q3 | 14 |
| 95-th percentile | 40 |
| Maximum | 282 |
| Range | 282 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 19.99635199 |
|---|---|
| Coefficient of variation (CV) | 1.698723526 |
| Kurtosis | 39.29017005 |
| Mean | 11.77139875 |
| Median Absolute Deviation (MAD) | 4 |
| Skewness | 4.995187932 |
| Sum | 33831 |
| Variance | 399.8540929 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 358 | 12.5% |
| 1 | 338 | 11.8% |
| 2 | 338 | 11.8% |
| 7 | 199 | 6.9% |
| 6 | 158 | 5.5% |
| 4 | 151 | 5.3% |
| 3 | 147 | 5.1% |
| 5 | 131 | 4.6% |
| 9 | 74 | 2.6% |
| 8 | 73 | 2.5% |
| Other values (97) | 907 |
| Value | Count | Frequency (%) |
| 0 | 358 | |
| 1 | 338 | |
| 2 | 338 | |
| 3 | 147 | |
| 4 | 151 | |
| 5 | 131 | 4.6% |
| 6 | 158 | |
| 7 | 199 | |
| 8 | 73 | 2.5% |
| 9 | 74 | 2.6% |
| Value | Count | Frequency (%) |
| 282 | 1 | |
| 245 | 1 | |
| 224 | 1 | |
| 193 | 1 | |
| 192 | 1 | |
| 181 | 1 | |
| 176 | 2 | |
| 163 | 1 | |
| 150 | 1 | |
| 145 | 1 |
max_tokens
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 165 |
|---|---|
| Distinct (%) | 5.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.01704941 |
| Minimum | 0 |
|---|---|
| Maximum | 436 |
| Zeros | 62 |
| Zeros (%) | 2.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 7 |
| median | 21 |
| Q3 | 35 |
| 95-th percentile | 95 |
| Maximum | 436 |
| Range | 436 |
| Interquartile range (IQR) | 28 |
Descriptive statistics
| Standard deviation | 37.98593803 |
|---|---|
| Coefficient of variation (CV) | 1.265478745 |
| Kurtosis | 21.56568885 |
| Mean | 30.01704941 |
| Median Absolute Deviation (MAD) | 14 |
| Skewness | 3.71489921 |
| Sum | 86269 |
| Variance | 1442.931488 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 183 | 6.4% |
| 2 | 179 | 6.2% |
| 27 | 148 | 5.1% |
| 6 | 140 | 4.9% |
| 24 | 121 | 4.2% |
| 4 | 101 | 3.5% |
| 23 | 78 | 2.7% |
| 5 | 74 | 2.6% |
| 18 | 66 | 2.3% |
| 9 | 65 | 2.3% |
| Other values (155) | 1719 |
| Value | Count | Frequency (%) |
| 0 | 62 | 2.2% |
| 1 | 23 | 0.8% |
| 2 | 179 | |
| 3 | 53 | 1.8% |
| 4 | 101 | |
| 5 | 74 | |
| 6 | 140 | |
| 7 | 183 | |
| 8 | 58 | 2.0% |
| 9 | 65 | 2.3% |
| Value | Count | Frequency (%) |
| 436 | 1 | < 0.1% |
| 403 | 1 | < 0.1% |
| 372 | 1 | < 0.1% |
| 338 | 1 | < 0.1% |
| 300 | 4 | |
| 294 | 1 | < 0.1% |
| 292 | 1 | < 0.1% |
| 289 | 1 | < 0.1% |
| 282 | 1 | < 0.1% |
| 264 | 1 | < 0.1% |
n_tokens_stopwords
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 576 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 167.6002088 |
| Minimum | 0 |
|---|---|
| Maximum | 8121 |
| Zeros | 59 |
| Zeros (%) | 2.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 4 |
| Q1 | 12 |
| median | 53 |
| Q3 | 149 |
| 95-th percentile | 660.1 |
| Maximum | 8121 |
| Range | 8121 |
| Interquartile range (IQR) | 137 |
Descriptive statistics
| Standard deviation | 422.6445119 |
|---|---|
| Coefficient of variation (CV) | 2.521742157 |
| Kurtosis | 117.593642 |
| Mean | 167.6002088 |
| Median Absolute Deviation (MAD) | 46 |
| Skewness | 8.828213253 |
| Sum | 481683 |
| Variance | 178628.3834 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 5 | 184 | 6.4% |
| 7 | 182 | 6.3% |
| 6 | 92 | 3.2% |
| 50 | 63 | 2.2% |
| 0 | 59 | 2.1% |
| 52 | 45 | 1.6% |
| 49 | 44 | 1.5% |
| 3 | 34 | 1.2% |
| 8 | 30 | 1.0% |
| 66 | 26 | 0.9% |
| Other values (566) | 2115 |
| Value | Count | Frequency (%) |
| 0 | 59 | 2.1% |
| 1 | 16 | 0.6% |
| 2 | 24 | 0.8% |
| 3 | 34 | 1.2% |
| 4 | 17 | 0.6% |
| 5 | 184 | |
| 6 | 92 | |
| 7 | 182 | |
| 8 | 30 | 1.0% |
| 9 | 21 | 0.7% |
| Value | Count | Frequency (%) |
| 8121 | 1 | |
| 7494 | 1 | |
| 6613 | 1 | |
| 5228 | 1 | |
| 4694 | 1 | |
| 4506 | 1 | |
| 3615 | 1 | |
| 3274 | 1 | |
| 3105 | 1 | |
| 3012 | 1 |
mean_tokens_stopwords
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 941 |
|---|---|
| Distinct (%) | 32.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.0787808 |
| Minimum | 0 |
|---|---|
| Maximum | 651.5 |
| Zeros | 59 |
| Zeros (%) | 2.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 22.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3.144444444 |
| Q1 | 8 |
| median | 24.95454545 |
| Q3 | 50 |
| 95-th percentile | 114.675 |
| Maximum | 651.5 |
| Range | 651.5 |
| Interquartile range (IQR) | 42 |
Descriptive statistics
| Standard deviation | 47.87027887 |
|---|---|
| Coefficient of variation (CV) | 1.257137909 |
| Kurtosis | 32.51514078 |
| Mean | 38.0787808 |
| Median Absolute Deviation (MAD) | 17.95454545 |
| Skewness | 4.39676692 |
| Sum | 109438.416 |
| Variance | 2291.563599 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 187 | 6.5% |
| 5 | 183 | 6.4% |
| 6 | 98 | 3.4% |
| 50 | 61 | 2.1% |
| 0 | 59 | 2.1% |
| 52 | 42 | 1.5% |
| 49 | 38 | 1.3% |
| 8 | 36 | 1.3% |
| 3 | 36 | 1.3% |
| 13 | 32 | 1.1% |
| Other values (931) | 2102 |
| Value | Count | Frequency (%) |
| 0 | 59 | |
| 0.1666666667 | 1 | < 0.1% |
| 0.1818181818 | 1 | < 0.1% |
| 0.4105263158 | 1 | < 0.1% |
| 0.5 | 1 | < 0.1% |
| 0.6666666667 | 1 | < 0.1% |
| 0.7142857143 | 1 | < 0.1% |
| 1 | 14 | 0.5% |
| 2 | 23 | 0.8% |
| 2.5 | 4 | 0.1% |
| Value | Count | Frequency (%) |
| 651.5 | 1 | |
| 555.5 | 1 | |
| 544 | 1 | |
| 468 | 1 | |
| 448 | 1 | |
| 446 | 1 | |
| 424 | 1 | |
| 381 | 1 | |
| 365 | 2 | |
| 343.5 | 1 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| title | id_thread | n_posts | tokens | interlocutors | n_interlocutors | dates | n_anonymes | n_bots | mean_post_per_interlocutor | mean_post_per_interlocutor_with_anonymous | max_post_per_interlocutor_with_anonymous | n_tokens | mean_tokens | min_tokens | max_tokens | n_tokens_stopwords | mean_tokens_stopwords | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Bandeaux à foison | 11324890_1 | 6 | ['rarement', 'article', 'fournir', 'bandeau', 'relever', 'article', 'respecte', 'doute', 'guère', 'grammaire', 'wikipédienn', 'lucky', 'Luke', 'bandeau', 'encombrer', 'guère', 'bon', 'usage', 'absence', 'débat', 'pdd', 'regrettable', 'y', 'page', 'fort', 'utile', 'bref', 'article', 'éminemment', 'perfectible', 'forme', 'diversité', 'sourçage', 'sauver', 'avis', 'Bonjour', 'poseur', 'bandeau', 'article', 'admissibilité', 'article', 'préciser', 'chose', 'considérer', 'lucky', 'Luke', 'bandeau', 'répondre', 'bandeau', 'poser', 'Copyvio', 'contributeur', 'remercier', 'rédaction', 'fournir', 'lien', 'accès', 'ressource', 'lien', 'accès', 'ressource', 'thèse', '559', 'page', 'y', 'copier', 'coller', 'section', 'ti', 'Complétons', 'fin', 'ri', 'Wikipédia', 'travail', 'inédit', 'opinion', 'excessivement', 'minoritaire', 'associer', 'source', 'juger', 'confidentiel', 'fiable', 'voire', 'simplement', 'interprétation', 'déduction', 'intuition', 'personnel', 'rédacteur', 'article', 'exemple', 'section', 'évolution', 'jour', 'facteur', 'jouer', 'faveur', 'vallée', 'fin', 'xix', 'siècle', 'retourner', 'inconvénient', 'disparaître', 'rente', 'énergétique', 'commander', 'couplage', 'usine', 'centrale', 'hydroélectrique', 'section', 'glorieux', 'conglomérat', 'omniprésent', 'vallée', 'falloir', 'oublier', 'envergure', 'non', 'national', 'exercer', 'stratégie', 'champ', 'mondial', 'doute', 'solidité', 'ancrage', 'nord-alpin', 'section', '1914', '1939', 'mention', 'spécial', 'faire', 'usine', 'Epierre', 'bas', 'Maurienne', 'four', 'électrique', 'origine', 'jusqu’', 'fermeture', 'consacrer', 'fabrication', 'dérivé', 'phosphore', 'agir', 'relocalisation', 'firme', 'Coignet', 'origine', 'lyonnais', 'terme', 'pérégrination', 'bandeau', 'sourçage', 'article', 'mérite', 'certainement', 'mieux', 'source', 'bien', 'qualité', 'unique', 'source', 'fête', '40', 'an', 'poser', 'problématique', 'mise', 'jour', 'également', 'partisan', 'sauver', 'article', 'contenu', 'favorable', 'fusion', 'toilettage', 'profond', 'article', 'Cdt', 'vrai', 'accumulation', 'bandeau', 'interpelle', 'fond', 'article', 'souffrir', 'bel', 'bien', 'multiple', 'problème', 'source', 'wikifier', 'ti', 'mesure', 'appuyer', 'thèse', 'ancien', 'important', 'essayer', 'expliquer', 'patiemment', 'auteur', 'manifestement', 'spécialiste', 'usage', 'encyclopédie', 'monde', 'sorte', 'gagnant', 'bonjour', 'bref', 'commencer', 'renommer', 'article', 'houille', 'blanc', 'Maurienne', 'trouver', 'source', 'secondaire', 'solide', 'traiter', 'sujet', 'résoudre', 'problème', 'signaler', 'voir', 'y', 'lieu', 'non', 'conserver', 'article', 'moment', 'page', 'apparaître', 'non', 'ti', 'copyvio', 'simple', 'fiche', 'lecture', 'résumé', 'thèse', 'Louis', 'Chabert', 'exception', 'dernier', 'section', 'constitue', 'sujet', 'admissible', 'absence', 'source', 'secondaire', 'thèse', 'caractère', 'simple', 'résumé', 'non', 'admissible', 'source', 'secondaire', 'indépendant', 'évaluer', 'thèse', 'ailleurs', 'confirmer', 'https://fr.wikipedia.org/w/index.php?title=industrie_de_la_houille_blanchediff=143234225oldid=143234146', 'commentaire', 'création', 'article'] | ['Borvan53', 'anonyme', 'anonyme', 'OT38', 'Binabik', 'Azurfrog'] | 6 | ['2017-12-06T15:56', None, None, '2017-12-06T16:48', '2017-12-07T20:57', '2017-12-12T09:18'] | 2 | 0 | 1.00 | 1.2 | 2 | 278 | 46.333333 | 29 | 65 | 635 | 105.833333 |
| 1 | Ton de l'article | 11324890_2 | 9 | ['déplacement', 'bandeau', 'ti', 'section', 'douteux', 'qualifierez', 'vous', 'trop', 'enjouer', 'promotionnel', 'passage', 'citer', 'haut', 'page', 'réponse', 'bonjour', 'bien', 'problème', 'source', 'unique', 'évite', 'difficilement', 'générer', 'manque', 'neutralité', 'ici', 'haut', 'affaire', 'page', 'promouvoir', 'thèse', 'Louis', 'Chabert', 'constitue', 'sorte', 'fiche', 'lecture', 'non', 'critique', 'faute', 'source', 'secondaire', 'analyser', 'évaluer', 'thèse', 'souligner', 'thèse', 'Louis', 'Chabert', 'largement', 'financer', 'Péchiney', 'Ugine', 'Kuhlmann', 'voir', 'lien', 'indiquer', 'haut', 'expliquer', 'tendance', 'article', 'favoriser', 'touche', 'activité', 'groupe', 'très', 'présenter', 'article', 'conclusion', 'manque', 'neutralité', 'doute', 'problème', 'essentiel', 'article', 'créer', 'Bonjour', 'oui', 'dsl', 'on', 'demander', 'suppr', 'article', 'y', 'amélioration', 'bout', 'temps', 'coopération', 'Louis', 'Chabert', 'lacune', 'voir', 'état', 'paf', 'aboutir', 'argument', 'd', 'rrr_utilisateur', 'chabert', 'louis_rrr', 'chabert', 'Louis', 'actif', 'intervention', 'effectivement', 'nécessaire', 'assurer', 'conservation', 'article', 'bonjour', 'falloir', 'temps', 'lancer', 'procédure', 'suppression', 'rien', 'venir', 'falloir', 'remplacer', 'bandeau', 'info', 'cf.', 'https://fr.wikipedia.org/wiki/wikip%c3%a9dia:requ%c3%aate_aux_administrateursti_manifeste', '_', '1', 'immédiatement', 'création', 'bandeau', 'admis', 'poser', 'chabert', 'Louis', 'venir', 'réagir', 'rrr_utilisateur', 'chabert', 'louis_rrr', 'demande', 'temps', 'devoir', 'donner', 'article', 'potentiel', 'plaisir', 'lire'] | ['OT38', 'OT38', 'Azurfrog', 'OT38', 'Azurfrog', 'Borvan53', 'Azurfrog', 'OT38', 'Borvan53'] | 9 | ['2017-12-12T09:52', None, '2017-12-12T10:04', '2017-12-12T10:15', None, '2017-12-12T12:15', '2017-12-12T12:37', '2017-12-12T12:52', '2017-12-12T23:21'] | 0 | 0 | 3.00 | 3.0 | 4 | 145 | 16.111111 | 1 | 60 | 331 | 36.777778 |
| 2 | Proposition de fusion entre [[Industrie de la houille blanche]] et [[Houille blanche]] | 11324890_3 | 8 | ['discussion', 'transférer', 'Wikipédia', 'page', 'fusionner', 'Bonjour', 'propose', 'retirer', 'jour', 'bandeau', 'fusion', 'archiver', 'pdd', 'respectif', 'procédure', 'ad', 'hoc', 'oui', 'RRR_Utilisateur', 'ot38_rrr', 'OT38', 'fusion', 'finalement', 'procédure', 'approprié', 'discussion', 'mérite', 'conserver', 'falloir', 'statuer', 'nuée', 'bandeau', 'copivio', 'admissibilité', 'etc.', 'historique', 'auteur', 'article', 'également', 'auteur', 'thèse', 'probable', 'thèse', '559', 'page', 'résume', 'grâce', 'copier', 'coller', 'bref', 'admissibilité', 'discute', 'pdd', 'fusion', 'contenu', 'évidence', 'gêne', 'beaucoup', 'article', 'industrie', 'houille', 'blanc', 'état', 'actuel', 'devoir', 'titrer', 'houille', 'blanc', 'Maurienne', 'coup', 'voir', 'bien', 'fusionner', 'article', 'appuyer', 'ailleurs', 'unique', 'source', 'contraire', 'demande', 'Wikipédia', 'risque', 'déséquilibrer', 'totalement', 'article', 'houille', 'blanc', 'vocation', 'traiter', 'industrie', 'houille', 'blanc', 'totalité', 'monde', 'Maurienne', 'France', 'Union', 'européen', 'monde', 'entier', 'moment', 'guère', 'ébauche', 'article', 'industrie', 'houille', 'blanc', 'Maurienne', 'commencer', 'démontrer', 'admissibilité', 'éventuel', 'renommage', 'résolution', 'problème', 'vouloir', 'insister', 'lancer', 'procédure', 'suppression', 'mieux', 'clôturer', 'procédure', 'lancer', 'réagir'] | ['anonyme', 'OT38', 'Borvan53', 'Borvan53', 'Azurfrog', 'Azurfrog', 'OT38', 'Nouill'] | 8 | [None, '2017-12-18T14:45', '2017-12-19T10:30', '2017-12-06T15:44', '2017-12-12T09:30', '2017-12-12T08:57', '2017-12-12T09:03', '2017-12-23T21:26'] | 1 | 0 | 1.75 | 1.6 | 2 | 125 | 15.625000 | 0 | 57 | 293 | 36.625000 |
| 3 | mon article industries de la houille blanche en Maurienne | 11324890_4 | 1 | ['ajouter', 'référence', 'bibliographique', 'choix', 'arbitraire', 'difficulté', 'préférer', 'reporter', 'place', 'référence', 'thèse', 'résulte', 'référencer', 'fois', 'devenue', 'inutile', 'papeterie', 'charge', 'supprimer', 'dernier', 'part', '23', 'souvenir', 'bien', 'proposer', 'face', 'texte', 'gauche', 'écran', 'traitement', 'cosmétique', 'propre', 'terme', 'devoir', 'je', 'soumettre', 'texte', 'traitement', 'affaire', 'suppose', 'avoir', 'revoir', 'ensemble', 'texte', 'esprit', 'fois', 'opération', 'terminer', 'rester', '-t', 'il', 'faire', 'mettre', 'article', 'conformité', 'code', 'wikipedia', 'souhaite', 'bien', 'sûr', 'voir', 'bout', 'aide', 'zzznote', 'type', 'unsigned_non', 'signé|chabert', 'Louis|27', 'janvier', '2018', '17:02', 'CET)|144917352|notif='] | ['bot'] | 1 | ['2018-01-27T17:02'] | 0 | 1 | 0.00 | 0.0 | 1 | 72 | 72.000000 | 72 | 72 | 156 | 156.000000 |
| 4 | articles enrichis | 11324890_5 | 1 | ['enrichir', 'article', 'commune', 'Maurienne', 'orelle', 'saint-etienne-de-cuine', 'créer', 'nouveau', 'source', 'article', 'houille', 'blanche--'] | ['CHABERT Louis'] | 1 | ['2018-02-02T22:11'] | 0 | 0 | 1.00 | 1.0 | 1 | 12 | 12.000000 | 12 | 12 | 23 | 23.000000 |
| 5 | None | 6656380_1 | 1 | ['avertissement', 'Homonymie', '|', 'revisionid=116606866', '|', 'tharsi', '--'] | ['anonyme'] | 1 | ['2015-08-08T00:46'] | 1 | 0 | 0.00 | 1.0 | 1 | 7 | 7.000000 | 7 | 7 | 7 | 7.000000 |
| 6 | None | 8038650_1 | 1 | ['Sourcer', 'jamais', 'nationalité', 'italien', 'malgré', 'mari', 'italien', 'enfant', 'italien', 'carrière', 'acteur', 'cinéma', 'théâtre', 'italien', 'sourcer', 'acquisition', 'nationalité', 'italien'] | ['anonyme'] | 1 | [None] | 1 | 0 | 0.00 | 1.0 | 1 | 18 | 18.000000 | 18 | 18 | 35 | 35.000000 |
| 7 | Revoil | 6358260_1 | 6 | ['Bonjour', 'suite', 'recherche', 'généalogique', 'famille', 'possession', 'acte', 'mariage', 'peintre', 'Pierre', 'Revoil', 'naître', '12/06/1776', 'Lyon', 'directeur', 'beau', 'art', 'Lyon', 'épouser', 'Joséphine', 'Henriette', 'Révoil', 'nièce', 'mineure', 'Aix-en-Provence', '14', 'janvier', '1816', 'sœur', 'aîné', 'Louise', 'acte', 'disponible', 'ligne', 'site', 'archive', 'départemental', 'bouche', 'Rhône', 'précise', 'note', '1', 'père', 'Louise', 'frère', 'natif', 'Lyon', 'directeur', 'poste', 'Aix-en-Provence', 'épouser', 'Henriette', 'Leblanc', 'Servanne', 'fille', 'Jean', 'Baptiste', 'Benoit', 'Leblanc', 'Servanne', '1738', '1822', 'Marguerite', 'Rousseau', 'héritier', 'père', 'chateau', 'Servannes', 'situer', 'pied', 'Alpilles', 'campagne', 'village', 'Mouriès', '25', 'kilomètre', 'Arles', '60', 'kilomètre', 'Ouest', 'Aix', 'Bonjour', 'renseignement', 'très', 'intéressant', 'savoir', 'peut-être', 'principe', 'Wikipédia', 'repose', 'source', 'fiable', 'figure', 'article', 'absolument', 'publier', 'manière', 'fiable', 'ailleurs', 'savoir', 'vous', 'existe', 'ouvrage', 'reprendre', 'information', 'cordialement', 'Bonjour', 'principal', 'référence', 'Louise', 'Colet', 'Joseph', 'S.', 'Jackson', 'Louise', 'Colet', 'ami', 'littéraire', 'Yales', 'Romanic', '1937', 'information', 'publier', 'page', 'Pierre', 'Révoil', 'Louise', 'Colet', 'issu', 'archive', 'bdr', 'y', 'accès', 'direct', 'page', 'concerner', 'vouloir', 'vérifier', 'cliquer', 'http://doris.archives13.fr/dorisuec/jsp/system/win_main.jsp', 'choisir', 'Aix', 'registre', 'paroissial', 'état', 'civil', 'rechercher', 'nouveau', 'page', 'cliquer', 'mariage', '“', 'entrer', '1816', 'fois', 'case', 'nouveau', 'cliquer', 'bouton', 'rechercher', 'être', 'registre', 'vouloir', 'falloir', 'aller', 'page', '98', '99.--', 'oui', 'régulièrement', 'site', 'AD13', 'problème', 'source', 'dire', 'source', 'primaire', 'acceptable', 'Wikipédia', 'soumettre', 'analyse', 'critique', 'renvoyer', 'détail', 'page', 'Wikipédia', 'source', 'primaire', 'secondaire', 'idéal', 'texte', 'publier', 'sujet', 'référence', 'avoir', 'citer', 'intéressant', 'cordialement', 'bonsoir', 'commentaire', 'source', 'primaire', 'secondaire', 'cas', 'venir', 'information', 'Louise', 'Colet', 'sœur', 'Pierre', 'Révoil', 'document', 'archive', 'disponible', 'belle-sœur', 'Pierre', 'Révoil', 'fille', 'benjamin', 'cousin', 'vérifier', 'apparaître', 'Joseph', 'S.', 'Jackson', 'Louise', 'Colet', 'ami', 'littéraire', 'Yale', 'Romanic', 'xv', '1937', 'courant', 'bien', 'cordialement', 'Nella', 'scheda', 'è', 'scritto', 'ch', 'figlia', 'non', 'è', 'stater', 'riconosciuter', 'marito', 'però', 'sul', 'sito', 'del', 'comune', 'di', 'Parigi', 'risulta', 'una', 'Colet', 'Henriette', 'Suzanne', 'nata', '16', 'luglio', '1840', 'quindi', 'meno', 'di', 'disconoscimento', 'successivo', 'di', 'paternità', 'bambina', 'è', 'stata', 'registrata', 'con', 'cognome', 'Colet', 'http://canadp-archivesenligne.paris.fr/archives_etat_civil/avant_1860_fichiers_etat_civil_reconstitue/fecr_visu_img.php?registre=v3e_n_0518type=ecrfbdd_en_cours=etat_civil_rec_fichiersvue_tranche_debut=ad075er_5mi20785_00604_cvue_tranche_fin=ad075er_5mi20785_00653_cref_histo=55684cote=v3e/n', '518'] | ['Claude.martin', 'Malost', 'Claude.martin', 'Malost', 'Claude.martin', 'anonyme'] | 6 | ['2012-06-12T10:42', '2012-06-12T10:58', '2012-06-12T13:17', '2012-06-12T13:25', '2012-06-12T23:43', None] | 1 | 0 | 2.50 | 2.0 | 3 | 279 | 46.500000 | 25 | 81 | 511 | 85.166667 |
| 8 | « de Servannes » | 6358260_2 | 1 | ['Bonjour', 'quelle(s', 'source(s', 'provenir', 'Louise', 'naître', 'Révoil'] | ['anonyme'] | 1 | [None] | 1 | 0 | 0.00 | 1.0 | 1 | 7 | 7.000000 | 7 | 7 | 12 | 12.000000 |
| 9 | None | 1855970_1 | 1 | ['renommer', 'critère', 'métrisabilité'] | ['Anne Bauval'] | 1 | ['2010-06-15T21:03'] | 0 | 0 | 1.00 | 1.0 | 1 | 3 | 3.000000 | 3 | 3 | 7 | 7.000000 |
Last rows
| title | id_thread | n_posts | tokens | interlocutors | n_interlocutors | dates | n_anonymes | n_bots | mean_post_per_interlocutor | mean_post_per_interlocutor_with_anonymous | max_post_per_interlocutor_with_anonymous | n_tokens | mean_tokens | min_tokens | max_tokens | n_tokens_stopwords | mean_tokens_stopwords | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2864 | [[gothisme]] et [[mouvement gothique]] | 4884870_5 | 9 | ['page', 'gothisme', 'créer', 'récemment', 'sujet', 'controverse', 'page', 'discussion', 'création', 'discuter', 'Gothisme', 'peut-être', 'discussion', 'ici', 'avis', 'extérieur', 'aboutir', 'solution', 'agir', 'bien', 'jargon', 'rien', 'fusion', 'mouvement', 'gothique', 'moindre', 'mal', 'avoir', 'préconiser', 'suppression', 'pur', 'simple', 'remarqu', 'concret', 'perdre', 'temps', 'propose', 'absence', 'opposition', 'appuyer', 'référence', 'redirect', 'mettre', 'place', 'ici', 'semaine', 'fusion', 'effectuer', 'jour'] | ['Sand', 'Darkline', 'Crobard', 'Agarwaen', 'GL', 'Case', 'Enkahel', 'GL', 'Sand'] | 9 | ['2005-11-01T08:30', '2005-11-01T09:05', '2005-11-01T10:41', '2005-11-01T11:36', '2005-11-01T11:55', '2005-11-01T12:11', '2005-11-01T13:32', '2005-11-01T11:55', '2005-11-07T07:41'] | 0 | 0 | 1.285714 | 1.285714 | 2 | 49 | 5.444444 | 0 | 18 | 101 | 11.222222 |
| 2865 | None | 1250500_1 | 1 | ['falloir', 'minimum', 'mettre', 'titre', 'tableau', 'présentable', 'savoir', 'agir', 'savoir', 'nommer', 'colonne'] | ['Isaac Sanolnacov'] | 1 | ['2007-01-03T13:00'] | 0 | 0 | 1.000000 | 1.000000 | 1 | 11 | 11.000000 | 11 | 11 | 34 | 34.000000 |
| 2866 | Janine / Jeannine ? | 1898590_1 | 1 | ['regardez', 'page', 'http://biographiesartistesquebecois.com/artiste-b/bergeronjano/bergeronjano.html', 'vrai', 'nom', 'épelle', 'Jeannine', 'Janine', 'savoir', 'vous', 'orthographe', 'correct', '-andy'] | ['217.50.59.156'] | 1 | ['2010-12-17T17:41'] | 0 | 0 | 1.000000 | 1.000000 | 1 | 13 | 13.000000 | 13 | 13 | 25 | 25.000000 |
| 2867 | None | 1892380_1 | 1 | ['Bonjour', 'indiquer', 'article', 'figure', 'contre', 'aide', 'mieux', 'comprendre', 'jouer', 'paramètre', 'trouve', 'figure', 'article'] | ['anonyme'] | 1 | [None] | 1 | 0 | 0.000000 | 1.000000 | 1 | 13 | 13.000000 | 13 | 13 | 30 | 30.000000 |
| 2868 | Discordances Wikidata | 10838710_1 | 2 | ['4', 'mai', '2017', 'discordance', 'remarquer', 'donnée', 'article', 'Wikidata', 'point', 'consulter', 'source', 'fiable', 'souhaitable', 'harmoniser', 'donnée', 'corrigeant', 'sourcer', 'article', 'corrigeant', 'sourcer', 'Wikidata', 'expliquer', 'raison', 'divergence', 'travail', 'effectuer', 'catégorie', 'article', 'information', 'diffèrent', 'Wikidata', 'pouvoir', 'enlever', 'catégorie', 'article', 'information', 'diffèrent', 'Wikidata'] | ['anonyme', 'anonyme'] | 2 | ['2017-05-04T15:23', None] | 2 | 0 | 0.000000 | 2.000000 | 2 | 38 | 19.000000 | 5 | 33 | 84 | 42.000000 |
| 2869 | None | 9334600_1 | 1 | ['avertissement', 'Homonymie', '|', 'revisionid=117142228', '|', 'Pédicelles'] | ['anonyme'] | 1 | ['2015-08-06T22:26'] | 1 | 0 | 0.000000 | 1.000000 | 1 | 6 | 6.000000 | 6 | 6 | 6 | 6.000000 |
| 2870 | None | 3136640_1 | 1 | ['traduire', 'page', 'permettre', 'modifier', 'organisation', 'page', 'original', 'mieux', 'séparer', 'texte', 'incertitude', 'rapport', 'mot', 'aide', 'bienvenue', 'voir', 'page', 'suivi', 'traduction', 'détail'] | ['Eldorino'] | 1 | ['2008-07-17T00:17'] | 0 | 0 | 1.000000 | 1.000000 | 1 | 20 | 20.000000 | 20 | 20 | 56 | 56.000000 |
| 2871 | Gospel | 295640_1 | 3 | ['version', 'anglais', 'donne', 'read', 'fiery', 'gospel', 'writ', 'burnished', 'steel', 'traduction', 'français', 'lire', 'ardent', 'texte', 'gospel', 'écrire', 'lisse', 'ligne', 'acier', 'traduire', 'lire', 'ardent', 'texte', 'évangile', 'écrire', 'lisse', 'ligne', 'acier', 'sembler', 'logique', 'non', 'oui', 'mieux', 'traduction', 'partie', 'inspirer', 'http://64.233.167.104/search?q=cache:qWacnGEXtMMJ:laurentmalet.free.fr/mariage/Ceremonie.html++%22J%27ai+lu+un+ardent+texte+de+gospel+%22hl=frlr=lang_fr', 'ici', 'falloir', 'mentionner', 'gospel', 'traduire', 'évangile', 'convier', 'modifier', 'retraduire', 'texte', 'entier', 'y', 'chose', 'modifier', 'vocabulaire', 'grammaire', 'expression', 'couplet', 'faire', 'lecteur', 'mmf--'] | ['Revas', 'ADM', '83.158.47.186'] | 3 | ['2005-06-18T08:21', None, '2011-03-09T11:37'] | 0 | 0 | 1.000000 | 1.000000 | 1 | 58 | 19.333333 | 13 | 31 | 119 | 39.666667 |
| 2872 | Trois Milliards de gens sur terre | 295640_2 | 1 | ['Mireille', 'Mathieu', 'performed', 'melodie', 'Wikipedia'] | ['anonyme'] | 1 | [None] | 1 | 0 | 0.000000 | 1.000000 | 1 | 5 | 5.000000 | 5 | 5 | 6 | 6.000000 |
| 2873 | Reprise dans la fiction | 295640_3 | 1 | ['morceau', 'apparaître', 'jeu', 'vidéo', 'fallout', '3', 'musique', 'diffuser', 'radio', 'Enclave', 'descendre', 'gouvernement', 'officiel', 'américain'] | ['anonyme'] | 1 | [None] | 1 | 0 | 0.000000 | 1.000000 | 1 | 14 | 14.000000 | 14 | 14 | 25 | 25.000000 |